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Abstract 



An inclusive search is presented for new heavy particle pairs produced in \/s = 7 TeV 
proton-proton collisions at the LHC using 4.7 ± 0.1 fb _1 of integrated luminosity. The 
selected events are analyzed in the 2D razor space of Mr, an event-by-event indicator 
of the heavy particle mass scale, and R, a dimensionless variable related to the missing 
transverse energy. The third-generation sector is probed using the event heavy-flavor 
content. The search is sensitive to generic supersymmetry models with minimal as- 
sumptions about the superpartner decay chains. No excess is observed in the num- 
ber of events beyond that predicted by the standard model. Exclusion limits are de- 
rived in the CMSSM framework as well as for simplified models. Within the CMSSM 
parameter space considered, gluino masses up to 800 GeV and squark masses up to 
1.35 TeV are excluded at 95% confidence level depending on the model parameters. 
The direct production of pairs of stop or sbottom quarks is excluded for masses as 
high as 400 GeV. 
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Models with softly broken supersymmetry (SUSY) [1-5] predict heavy superpartners of the 
standard model (SM) particles. Experimental searches for R-parity flU conserving SUSY at 
the Tevatron [7, 8] and the Large Hadron Collider (LHC) |9l 43"0l have focused on signatures 
combining energetic hadronic jets and leptons from the decays of pair-produced squarks and 
gluinos, with large missing transverse energy (£™ 1SS ) from the two weakly interacting lightest, 
neutral superpartners (LSPs) produced in separate decay chains. 

In SUSY models, the scale of soft SUSY breaking is related to the scale of electroweak symmetry 
breaking. This implies either that the soft-breaking mass parameters cannot be too large, or that 
the smallness of the electroweak scale is explained by large cancellations arising from relations 
among these parameters in the high energy theory. The latter possibility is complicated by 
large radiative corrections, particularly those induced by the soft-breaking parameters that are 
responsible for the masses of the stops and sbottoms, the superpartners of the third-generation 
quarks. It is thus of special importance to search for the lightest allowed stops and sbottoms, 
whose decays will be enriched in heavy flavor. 

In this Letter we present results of an inclusive search for new heavy particles. The analysis 
is designed to be largely independent of the details of the decay chains and measures devi- 
ations from the characteristic distributions of the relevant SM processes in the razor variable 
plane |3T[ |3~2ll . It is generically sensitive to the production of pairs of heavy particles, pro- 
vided that the decays of these particles produce significant E™ 1SS , that these particles are sub- 
stantially heavier than any SM particle, and that they are strongly produced in high-energy 
proton-proton collisions. The selection requires only two or more energetic reconstructed cal- 
orimeter objects [33]. The selected events are sorted hierarchically into exclusive data samples, 
categorized according to the lepton multiplicity in the event. The analysis is repeated with 
the requirement of the presence of a bottom-quark jet (b-jet) to search for third-generation- 
enhanced SUSY signatures. While in Monte Carlo studies all SM processes are considered, the 
major backgrounds are top production and vector boson production in association with jets. 

The razor kinematic variables are based on the generic process of the pair production of two 
heavy particles, each decaying to an undetected particle plus visible decay products. Regard- 
less of its complexity, each event is treated as a dijet-like event by grouping all hadronic jet 
candidates and isolated electrons in the event into two megajets II32II . The razor kinematic vari- 
ables are used to test, event-by-event, the hypothesis that the megajet topology represents the 
visible portion of the decays of two heavy particles. Assuming the pair of megajets accurately 
reconstructs the visible portion of the parent particle decays, the signal kinematics is equiva- 
lent, for example, to pair production of heavy squarks qi, qz, with q,- — > where the x° are 
LSPs and denotes the visible products of the decays. 

The Mr razor kinematic variable is defined in terms of the momentum of the two megajets as 
Mr = [ ( | p h | + | p & | ) 2 — ( pi 1 + pi 2 ) 2 ] 1 /2 and is, by construction, invariant under longitudinal 
boosts. In the approximation of massless megajets and negligible initial-state pj, Mr equals 
7a Ma, where Ma = (M~ — M~)/Mq is twice the magnitude of the momentum of either mega- 
jet in the respective squark rest frame, and 7a is the boost factor from the center-of-mass frame 
to the squark rest frames. Note that this definition of Mr is amended from that in [32] to avoid 
configurations where the razor variable is ill-defined due to unphysical Lorentz transforma- 
tions. 

The razor observable M^ is defined as Mf = \ (^Ef iss (p'j + pf) - E™ iss -(p£ + p^)\ 1/2 , 

where pi' 2 are the transverse momentum vectors of the two megajets and E™ 1SS is the missing 
transverse momentum vector (also referred to as missing tranverse energy). The razor dimen- 
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sionless ratio is defined as R = . For signal events Mj has a maximum value (a kinematic 
endpoint) of Ma, so R has a maximum value of approximately one. Thus signal events are 
characterized by a distribution in Mr that peaks around Ma, and a distribution in R that peaks 
around 0.5, in stark contrast with, for example, QCD multijet background events, whose distri- 
bution in either R or Mr is exponentially suppressed away from zero |32j[33]|. These properties 
determine a region of the 2D razor space where the standard model background is reduced 
while the signal is retained. 

A detailed description of the CMS detector can be found elsewhere |34). A superconduct- 
ing solenoid provides an axial magnetic field of 3.8 T. The silicon pixel and strip tracker, the 
high-resolution crystal electromagnetic calorimeter (ECAL), and the brass/ scintillator hadron 
calorimeter (HCAL) are contained within the solenoid. Muons are detected in gas-ionization 
chambers embedded in the steel return yoke. The HCAL, combined with the ECAL, measures 
the jet energy with a resolution AE/E w 100 %/y/E/GeV 5 %. CMS uses a coordinate sys- 
tem with the origin located at the nominal collision point, and the pseudorapidity is defined 
as rj = — ln[tan(0/2)], where the polar angle 9 is defined with respect to the counterclockwise 
beam direction. 

The analysis uses a set of dedicated triggers that apply lower thresholds on the values of R and 
Mr computed online from the reconstructed jets and E™ 1SS . Three trigger categories are used: 
i) hadronic razor triggers applying threshold requirements |t33| on R and Mr in events with at 
least two jets of pj > 56 GeV; ii) muon razor triggers that have looser R and Mr requirements 
than the hadronic triggers and combined with at least one muon in the central part of the detec- 
tor (barrel) with pj > 10 GeV; iii) electron razor triggers with similar R and Mr requirements 
to those used for muons and with at least one electron of pj > 12 GeV satisfying loose isolation 
criteria. In addition, a set of non-razor triggers is used to define control data samples. 

Events, after detector- and beam-related noise cleaning, are required to have at least one high- 
quality reconstructed interaction vertex [35 1. When multiple vertices are found, the one with 
the highest associated yj trac k p\ is selected. The electron and muon candidate reconstruction 
and identification criteria are described in Ref. Ij36| . Electrons and muons are required to lie 
within \r]\ < 2.5 and 2.1, respectively, and to satisfy the identification and selection require- 
ments from [36 j. Jets are reconstructed from calorimeter energy deposits using the infrared- 
safe anti-fcx algorithm |37[ with radius parameter 0.5. Jets are corrected for non-uniformities 
of the calorimeter response using energy- and ^-dependent correction factors. Only jet candi- 
dates with pj > 40 GeV within \rj\ < 3.0 are retained. The jet energy scale uncertainty for these 
corrected jets is 5% M38| . To match the trigger requirements, the pj of the two leading jets is 
required to be greater than 60 GeV. The transverse momentum imbalance in the event, E™ ss , is 
reconstructed using the particle flow algorithm 1139 1 . 

The reconstructed jets are grouped into two megajets |33|. The megajets are constructed as a 
sum of the four-momenta of their constituent objects. After the baseline selection and calcula- 
tion of the variables R and Mr, the events are assigned to one of six final state boxes according 
to whether the event has zero, one, or two isolated leptons, divided according to lepton flavor 
(electrons and muons) as shown in Table [I] 

The requirements given in Table [j] define the full analysis regions of the R 2 -Mr plane, where the 
analysis is performed for each box. They are the loosest possible requirements that allow for the 
valid background description, while at the same time maintaining fully efficient triggers. To 
prevent ambiguities for events satisfying the selection requirements of more than one box (33j, 
the boxes are arranged in a predefined hierarchy, as given in Table [l] Each event is uniquely 
assigned to the first box whose criteria are satisfied by the event. 
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Table 1: Razor boxes definition. The variables and requirements are explained in the text. 



Lepton boxes Mr > 300 GeV, 0.11 < R 1 < 0.5 



ELE-MU 


pf > 20 GeV, p^ > 15GeV 


MU-MU 


p f j > 15 GeV, pf > lOGeV 


ELE-ELE 


pf > 20 GeV, pf > lOGeV 


MU 


p\ > 12GeV 


ELE 


p\ > 20GeV 



HAD box Mr > 400 GeV, 0.18 < R 2 < 0.5 



Six additional boxes are formed for events with at least one b-jet tagged using the Track- 
Counting High-Efficiency (TCHE) b-tagging algorithm with 1% misidentification rate l|40ll4T] |. 
These six boxes define the razor inclusive analysis of data samples with enhanced heavy-flavor 
content. 



The razor analysis is guided by studies of simulated events generated with the PYTHIA6 
and MadGraph [43J Monte Carlo programs, implemented using the CMS GEANT4-based 04] 
detector simulation, and then processed by the same software as that used to reconstruct data. 
Events with QCD multijets, top quarks, and electroweak bosons are generated with Mad- 
Graph interfaced with PYTHIA for parton showering, hadronization and the underlying event 
description. To generate Monte Carlo samples for SUSY, the mass spectrum is first calculated 
with SOFTSUSY (451 and the decays with SUSYHIT 06||. The PYTHIA6 program is used with the 
SLHA interface |l47l to generate the events. Next-to-leading order (NLO) plus next-to-leading- 
logarithm (NLL) cross section calculations are used |48-53|. 

For each of the main SM backgrounds, a control data sample is defined from a subset of the 
data dominated by this particular background. It is used to obtain a description of the shapes 
of the background components from data. In both simulation and control data samples, the dis- 
tributions of the major SM background events (QCD multijets, tt + jets, Z + jets, and W + jets) 
are found to have a simple exponential dependence on the razor variables R 2 and Mr over a 
large part of the R 2 -Mr plane. A full 2D SM background representation is built using statisti- 
cally independent data samples. This representation used as input to the final fit performed in 
each fit region (FR) defined by an L-shaped area in the R 2 -M R pi ane such as shown in Fig. 2 
The fit region is dominated by SM processes. Signal contamination has only a small impact on 
the determination of the background shape in the fit region, as demonstrated through studies 
based on simulation. The 2D background model obtained in the fit region is extrapolated to 
the rest of the R 2 -Mr plane where the analysis is sensitive to a potential SUSY signal. 

The fit function has parameters describing the shapes and normalization of the R 2 and Mr 
distributions of the SM backgrounds. The two-dimensional probability density function (pdf) 
R/(Mr,R 2 ) describing the R 2 versus Mr distribution of each considered SM process j is found 
to be well-approximated by instances of the same function Fj (Mr, R 2 ) : 

F ; (M R , R 2 ) = \kj(M R - M° R ,) (R 2 - R 2 ,) - 1 x 

e -fc 7 (M R -M^)(R 2 -R2.)_ 

The scaling of the exponent as a function of the thresholds on Mr and R 2 is described by the h 
parameters of the function. When integrated over Mr (R 2 ), this function recovers the exponen- 
tial dependence on R 2 (Mr). 

Each SM process in a given final-state box is well described by a pdf P; that is the sum of a first 
and a second component of the functional form Eq.[T]with separate normalizations. Studies of 
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simulated events and fits to control data samples with either a b-jet requirement or a b-jet veto 
indicate that the parameters corresponding to the first components of these backgrounds (with 
steeper slopes at low Mr and R 2 ) are box-dependent. The parameters describing the second 
component are box-independent, and at the current precision of the background model, they 
are identical among the dominant backgrounds considered in these final states. 

These sets of independent data control samples are used to derive a priori the background 
shape parameters. The results are incorporated in the final fits as a set of Gaussian penalty 
terms 11541 1551 for the parameters kj, M R ^ and R 2 , ■ multiplying the final likelihood (Eq.(2|). The 
RMS-values of the penalty terms for the h parameters are typically ~30%. 

An extended and unbinned maximum likelihood (ML) fit is performed in each box using 
ROOFlT ||55| . The fit performed in the fit region of the R 2 -Mr plane provides the description of 
the SM background in the full plane. The likelihood function for a given box is written as 



a = — Wf — n e m( M R,i' R i) > ( 2 ) 

iV ' i=X \jeSM J 

where N is the total event yield in the box, the sum runs over all the SM processes relevant for 
that box, and Nj is the yield of a given fit sample in the box. 

The values of the shape parameters that maximize the likelihood in these fits, along with the 
corresponding covariance matrix, are used to define the background model and the uncer- 
tainty associated with it. Additional background shape uncertainties due to the choice of the 
functional form were found to be negligible 11331 . 

The result of the ML fit projected on Mr and R 2 is shown in Fig. [I] for the HAD box. No 
significant discrepancy is observed between the data and the fit model in any of the six boxes 

HQ. 
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Figure 1: Projection of the 2D fit result on Mr (left) and R 2 (right) for the HAD box. The blue 
histogram is the total standard model prediction as obtained from a single pseudo-experiment 
based on the 2D fit. The red and green histograms represent a steep-slope component denoted 
as V + jets first component, and a component that encapsulates the steep-slope first component 
in tt + jets and the effective second component, which is indistinguishable for the different SM 
background processes. The fit is performed in the R 2 -Mr fit region (FR as shown in Fig.|2|| and 
projected into the full analysis region. Only the statistical uncertainty band on the background 
predictions is drawn in these projections. 



In order to establish the compatibility of the background model with the observed dataset, we 
define six signal regions (SRz) in the tail of the background distribution. Using the background 
model returned by the ML fit, we derive the distribution of the expected yield in each SRz using 
pseudo-experiments accounting for correlations and uncertainties on the parameters describ- 
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Figure 2: The p-values corresponding to the observed number of events in the HAD box signal 
regions (SRz). The green region indicates the fit region (FR) in the HAD box. Similar results are 
obtained for the other boxes. 



ing the background model. For each of the SRz' the distribution of the number of events derived 
by the pseudo-experiments is used to calculate a two-sided p-value (as shown for the HAD box 
in Fig. |2|, corresponding to the probability of observing an equal or less probable outcome for a 
counting experiment in each signal region. The p-values test the compatibility of the observed 
number of events in data with the SM expectation obtained from the background parameteri- 
zation. We quote the median and the mode of the yield distribution for each SR, together with 
the observed yield. 

For each box we consider the test statistic given by the logarithm of the likelihood ratio In Q = 
In C £^}]P / where H is the hypothesis under test; Hi (signal plus background) or the null hy- 
pothesis Ho (background-only). Given the distribution of In Q for background-only and signal- 
plus-background pseudo-experiments, and the value of In Q observed in the data, we calculate 
CL s+ b and 1 — Ct^, |57H58J. From these values the CL S = CL s+ t,/CLb is computed for that 
model point. A point in the constrained minimal supersymmetric standard model (CMSSM) 
plane is excluded at 95% confidence level (CL) if CL S < 0.05. The result is shown in Fig. [3] 
The shape of the observed exclusion curves reflect the changing relevant SUSY strong produc- 
tion processes across the parameter space with squark-squark and gluino-gluino production 
dominating at low and high mo, respectively. The observed limit is less constraining than the 
median-expected limit at lower tuq due to an excess of observed events in the HAD box at large 
R 2 , where squark-pair production dominates over gluino-pair production. Cascading decays 
of gluinos yield more leptons than decays of squarks. Thus, relative to hadronic boxes, the 
contribution of lepton boxes increases with mo. 

We estimate the systematic bias on the signal shape model due to parton density functions 
(point-by-point up to 30%), jet energy scale (point-by-point up to 1%) and lepton identification 
(using using Z — > £ + £~ data, 1% per lepton), and on the signal yield due to the luminosity 
uncertainty (2.2%) 1591 , the theoretical cross section (point-by-point up to 15%), razor trigger 
efficiency uncertainty (2%), and lepton trigger efficiency uncertainty (3%). In the b-tag analysis 
path an additional systematic is considered for the b-tagging efficiency (between 6% and 20% 
in px bins HOI). We consider variations of the function modeling the signal uncertainty (log- 
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normal versus Gaussian) as well as the R 2 and Mr binning choice, finding negligible deviations 
in the result. 

The results are also interpreted as cross section limits on a number of simplified models [60j, 
where a limited set of hypothetical particles and decay chains are introduced to produce a given 
topological signature. Specific applications of these ideas have appeared in Refs. ||6lVl63l . For 
each model studied, the excluded cross section at 95% CL is derived as a function of the mass 
of the produced particles (gluinos or squarks, depending on the model) and the LSP mass, as 
well as the exclusion curve corresponding to the NLL-NLO cross section. Exclusion curves 
for a factor of three cross section enhancement or reduction are also produced as well as for 
±1 standard deviation variations in the NLL-NLO cross section [33] . Figure [4] shows the 95% 
CL [|64| excluded largest parent mass as a function of the LSP mass in each of the simplified 
model studies, for both the inclusive and b-jet versions of the analysis. 




m [GeV] 



Figure 3: Observed (solid blue curve) and median-expected (dashed curve, shown with its ±1 
standard deviation uncertainty band) 95% CL limits in the (mo, OT1/2) CMSSM plane (drawn 
according to [65 1) with tan/3 = 10, Aq = OGeV, and sgn(^) = +1. Shown separately are the 
observed HAD-only (solid crimson) and leptonic-only (solid green) 95% CL limits. 
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Figure 4: Summary of the 95% CL excluded largest parent mass as a function of the LSP mass 
in each of the simplified models studied. Results from the inclusive razor analysis (upper bars) 
and the b-jet razor analysis (lower bars) are shown. 



In summary, we performed a search for squarks and gluinos using a data sample of 4.7fb 1 of 
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CMS data at -^/s = 7 TeV proton-proton collisions in the razor variable space using a 2D shape 
description of the relevant standard model processes. 

No significant excess over the background expectations is observed, and the results are pre- 
sented as a 95% CL limit in the (mo, m.1/2) CMSSM parameter space. For m(q) ~ m(g) we 
exclude squarks and gluinos up to 1.35 TeV, and for m(q) > m(g) we exclude gluinos up to 
800 GeV. For simplified models we exclude up to 1 TeV for the gluino mass and up to 800 GeV 
for the first and second generation squark masses. For direct production of pairs of stops or 
sbottoms we exclude stop and sbottom masses up to 400 GeV depending on the LSP mass. 
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